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Optimal Relay Selection with Non-negligible Probing Time 
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Abstract —In this paper an optimal relay selection algorithm 
with non-negllglhle probing time is proposed and analyzed 
for cooperative wireless networks. Relay selection has been 
introduced to solve the degraded bandwidth efficiency problem 
in cooperative communication. Yet complete information of 
relay channels often remain unavailable for complex networks 
which renders the optimal selection strategies impossible for 
transmission source without probing the relay channels. Par¬ 
ticularly when the number of relay candidate is large, even 
though probing all relay channels guarantees the finding of the 
best relays at any time Instant, the degradation of bandwidth 
efficiency due to non-negllglhle probing times, which was often 
neglected in past literature, is also significant. In this work, a 
stopping rule based relay selection strategy is determined for 
the source node to decide when to stop the probing process and 
choose one of the probed relays to cooperate with under wireless 
channels’ stochastic uncertainties. This relay selection strategy 
is further shown to have a simple threshold structure. At the 
meantime, full diversity order and high bandwidth efficiency 
can be achieved simultaneously. Both analytical and simulation 
results are provided to verify the claims. 

Index Terms —Optimal relay selection, stopping rule, diver¬ 
sity gain, probing times 

1. Introduction 

There is a quickly increasing demand for high data rates in 
wireless communication along with the skyrocketing usage 
of mobile devices. Increasing transmission diversities is 
among the most promising techniques and is attracting much 
attention [l]-[3]. Transmission diversity in communication 
systems provides more than one copy of the transmitted 
signal to the destination node, with which the destination 
can decode the transmitted signal even if some copies of 
the signal are distorted due to the time varying nature of 
transmission channels; therefore the system performance can 
be expected to improve significantly. More precisely the 
diversity order of a communication system can be measured 
by the relationship between the error probability, denoted by 
Pe, and the Signal-Noise-Ratio (SNR) as follows : a system 
has diversity order d if 

Pe = 6)(SNR^^). (1) 

Cooperative communication techniques have been intro¬ 
duced to increase system’s diversity order (see [4]). In a 
cooperative network, when a node receives a packet not 
destined for it, instead of simply discarding the packet it can 
choose to help to relay and via such help, the source trans¬ 
mission can improve its diversity by sending signal through 
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the relay channels. Although cooperative communications 
look promising, the gain is not immediate as cooperation 
incurs a cost of wireless resources such as frequency, active 
air time, and power resources to enable relaying. This trade¬ 
off between gaining diversity and conserving the wireless 
resources is mainly two-fold. 

The first aspect is due to signal transmission. Since wire¬ 
less resources are spent for relaying the signal to the destina¬ 
tion, more bandwidth is needed when more relay nodes are 
involved in a transmission. To see this point more clearly, 
if N nodes are relaying the signal, the bandwidth efficiency 
degrades to dues to the fact that there are all-together 
N relays and the source are transmitting the same copy of 
signal. This problem of degraded bandwidth efficiency is 
solved by the introduction of several relay selection protocols 
as detailed in [5]-[17]. With relay selection, only one relay 
instead of all is selected for the purpose of cooperation. 
Therefore, the bandwidth efficiency is leveraged to 5 instead 
of which decreases as the number of relays increases. A 
relay selection protocol is called to achieve full diversity or¬ 
der if the resulting diversity order is A-l -1 for a network with 
N relay nodes. When complete relay channel information is 
available, relay selection protocols with different selection 
metrics have been proved to achieve full diversity order in 
the literature (see [5]-[13]). For example in [12], best relay 
selection (choosing the relay with best channel SNR), best 
worse channel selection (choosing the relay with best worse 
channel condition) and best harmonic mean method have 
been proved to achieve full diversity order. 

The second aspect, which was often neglected, comes 
from channel probing. Due to the dynamic nature of channel 
conditions, channel probing is needed for exploring each 
relay channel’s instantaneous transmission quality in order 
to find the best relay to cooperate with. Specifically we 
consider the following channel probing procedure. Before 
each transmission, the source sequentially probes the chan¬ 
nels between the source and relays, and the channels between 
relays and the destination. The source can stop the probing 
process at any time, and select one of the probed relays to 
cooperate with. For each probing, a carrier sensing packet is 
sent to reveal each relay channel’s instantaneous quality (see 
[18], [19]). Despite the efforts towards reducing the size of 
a sensing packet, the probing time remains non-negligible. 
In this regards, even though probing all channels guarantees 
the discovery of the set of best relays, the degradation of 
bandwidth efficiency due to probing times is significant, 
especially when the number of relays is large. Therefore 
opportunistic channel probing (as commonly defined in the 
Opportunistic Spectrum Access (OSA) [20]) is needed for a 



bandwidth efficient system. 

In this work we design a bandwidth efficient relay selec¬ 
tion strategy in a relay network with non-negligible probing 
time. Technically we adopt theory of optimal stopping rule 
(which has been previously applied to opportunistic spectrum 
access, for instance [21], [22] and references therein.) to 
solve our relay selection problem. Our main contributions 
are as follows. 


• We determine the optimal relay selection strategy and 
we show the optimal strategy is a simple threshold 
enabled stopping rule. 

• Our relay selection strategy achieves full diversity order. 

• This optimal relay selection results in bounded probing 
time, which further implies a relay selection scheme 
with high bandwidth efficiency. 


The rest of our paper is organized as follows. We formulate 
the relay selection problem as an optimal stopping problem 
in Section |I^ In Section III we solve the optimal stopping 
problem and define the corresponding relay selection strat¬ 
egy. We present in Section |IV] the analysis of diversity order 
and bandwidth efficiency of the proposed strategy. In Section 
[V] we verify our results through simulation and conclude our 


paper in Section VI 


II. System Model and Problem Formulation 


A. System model 

We consider a relay network with one source S, one 
destination D and N relays Ri,R 2 , ■■■,Rn (as depicted in 
Fig|^ . We model the channels between any two nodes (S, 
D or Ri,R 2 ,. ■ ■ ,Rn) as discrete time independent Rayleigh 
fading channels. Specifically the direct source to destination 
channel, the N source to relay channels, and the N relay to 
destination channels are modeled as follows 


Ss.d ~ l^s^ds/^s^s “b Vs■^dJ (2) 

y^^fi — “b ft>r 17=1,2,...,V, (3) 

yn,d ~ l^n,ds/~Rr^n ~b ^ ...,V, (4) 

where Pg and Py are the transmission power at the source 
and relays respectively; Xs,x„ are the signals transmitted or 
retransmitted by the source and the nth relay respectively; 
ys,d,yn,dtys,n,n = 1,2,...,N are the received signals at the 
destination D from sources and N relays, and the nth 
relay R„,n = respectively. The channel noises 

Vs,d,'ns,n,'nn.d ^6 modeled as Gaussian random variables 
with zero-mean and variance rjo- 



Fig. 1. The relay network 


assume these channel coefficients remain the same during 
the transmission of one source signal. Moreover, the channel 
coefficients hs^d,hs,n,hn,d,n = 1,2 ,... ,V are modeled as zero- 
mean, complex Gaussian random variables with unit variance 
for each signal transmission. For simplicity of later analysis, 
we use cOgd , (0s,„ ,COn.d,n —1,2,...,N to denote the magnitude 
of the channel coefficients: 

^s,d ~ ; (5) 

(Og^fj = \hg fi I ,n — 1,2,... ,N, (6) 

(On.d = \hn,d\^,n = l,2,...,N. (7) 


For each signal transmission, besides the direct transmis¬ 
sion to the destination, the source node selects one relay 
out of the N candidates to cooperate using perfect Direct 
Forwarding (DF) protocol (for technical details please refer 
to [11]). That is, when relay is selected, the source S 
will first send the signal Xg to both the relay and the 
destination D. Let Xg^,, denotes the decoded signal at R„, and 
the transmitted signal of relay is given by 


x„ 


Xg, if Xg — Xg^i[, 
0 , if ^ Xg^ii. 


( 8 ) 


i.e., the relays will forward the message/signal only if it has 
been correctly decoded. 


B. The Relay Selection Problem 

We formulate the relay selection problem with channel 
probing. For each relay R„ we adopt the following index (On 
introduced in [11] as the criteria for selecting relay, 

(On = -^- , (9) 

(Jl^n.d ~b (l2^s,n 

with q' 1, g'2 being constants asdefinedin[ll]. When complete 
information (all (Og,n,(0„^d^) of all channels is available, it 
is shown in [11] that selecting the relay with maximum (On 
gives full diversity gain, and this is also the major reason we 
adopt (On as the index for relay Rn,n = 1,2,...,N. Moreover 
if we view a)„ as an approximated channel gain of using 
relay R„, Pg- (On becomes the approximated signal power at 
the destination through R„,n = l,...,N. 

Before each transmission, we assume the source sequen¬ 
tially probes the channels between the source and relays, and 
the channels between relays and the destination. The channel 
probing procedure, as shown in Fig. can stop at any stage 
n< N when the channels connected to relays Ri,R 2 ,... ,Rn 
are probed and select one relay Rk,k<n to cooperate with. 
Let Tg be the probing time to probe the channels between 
{S,Rn) and {Rn,D) (for example, Tg could be a cycle of 
RTS/CTS period for IEEE 802.11 channel sensing protocol) 
and Tfnan be the time for signal transmission of the source 
and the selected relay. Then for any n — \,2,...,N, the 
time to probe relays R\,R 2 ,... ,Rn is nTg, and the bandwidth 
efficiency is given by 


Eor all the channels, hgd,hg^n,hn^d,n= 1,2,...,N are co¬ 
efficients that capture the effect of path loss, fading etc. We 


i+nTg 


1 

1+nx' 


( 10 ) 








III. Solution to the Optimal Stopping Problem 


‘ * 



Probe 

Probe R2 


Probe R„ 

S transmits 

Aj; transmits 


Fig. 2. The Channel Probing Procedure 


From the theory of optimal stopping times (see [23, chap. 
2]) , we define the value functions with the information state 
{n„, 1 < n < A^} as follows. 

For stage n = 1,2,... define 


where T = denoting the ratio between the probing time 
and the transmission time. 

We define the signal to noise ratio (SNR) y of the 
system to be y — P/170, where P is the total transmission 
power (including source and the selected relay). Consider 
a constant power scheme and for a fair comparison, the 
sum of transmission power at the source and the selected 
relay should equal the total power P times the bandwidth 
efficiency, i.e. 


P,+Pr = P- 


( 11 ) 


The reason we take total transmission power to be P • c„ in 
o is due to the fact that (1 — c„) fraction of total power 
P is used in channel probing. Denote r as the power ratio 


r= -pjr- Then, 

r-Ln 


Pi = rPc„, 

P, = (l-r)P-c„. 


( 12 ) 

(13) 


When the channel probing process stops at stage n and 
relay channels Oj i,01.92 ,G)i,n are probed, the maximum 
relay index is max<;<„a)„, and the corresponding approxi¬ 
mated signal power at the destination is given by 


where 


Pi max (Ok = rP ■ c„D.„, 

\<k<n 


= max (Ok,n = 1,2,... ,N . 

\<k<n 


(14) 


(15) 


maxA,^, £[rP-CAr^nivJ 
s.t. Ns is a stopping time,A(s < N a.s. 


(16) 


Assuming rP to be a constant, then ( [T6| ) is equivalent to 
the following optimal stopping problem. 


maxAT, 

s.t. Ns is a stopping time,A(i < N a.s. 


(17) 


^The objective function defined in is an intuitive measure. However 
we will show in Section IV the stopping rule maximizing this particular 
objective achieves full diversity order. 


Vn{x):= max E[c7v,nAfjn„ = 2c] , (18) 

Ns is a stopping time,n<Ns<N 

and we can write down the backward induction for the value 
functions as follows, 

Vn{x) =cn-x, 

y„(x) =max{c„x,E[V'„+i(n„+i)|n„ =x]},n < A- 1. (19) 

From the backward induction, we obtain the structure of the 
optimal stopping time stated in the following theorem. 
Theorem 1: The optimal stopping rule N* for the optimal 


stopping problem described by (171 is given by thresholds 
fi,f2,... ,fAr-i such that 


N* = infjn > 1 : CnTln > ?«}, 


( 20 ) 


where the threshold t„, n — 1,2,...,N — \ is the unique 
solution of the following fixed point equations 


l (^n+1) — ^n]- 

Consequently, the value functions satisfy 


( 21 ) 


( 22 ) 


The relay selection problem defined in this paper is to 
decide when to stop probing the relay channels and which 
probed relay to cooperate with to maximize the expected 
value of the approximated signal power at the destinationj^ 
Formally, we want to choose a stopping time Ns with 
respect to the sequential channel probings and the channel 
realizations {(Os,„,(On.d,n = 1,2,...,A} and we formulate the 
relay selection problem as an optimal stopping problem as 
follows. 


, . _ J c„x if X > f„, 

\ E[y„+i(£2„+i)|£l„=x] ifx<f„. 

Proof: The proof can be found in Appendix-]^ ■ 
Theorem states that the optimal stopping rule N* to 
problem ([17^ we formulated in Section |I^ is described by a 
set of thresholds fi ,f2 ,... based on which we propose 

the relay selection strategy RS_OSR = ... ,£/)!)} as 

follows. The decision d* at each stage n < N — I is given 
by 


d* = 

L-ly, 


Stop and choose Rk if > tn,(Ok = Tl„, 


Continue 


if On^n ^ ^rt: 


(23) 


and at the final stage N 
Choose Rk 


d*N = 


if Q.N > (Os^d, (Ok = Tl„, 
Do not choose any relay if <(Osd- 

(24) 


IV. Pereormance Analysis 
In this section, we analyze the performance of the relay 


selection strategy RS_OSR defined in Section III We show 
in Section IV-A| that RS_OSR achieves full diversity order. 
In Section IV-B| we obtain an upper bound on the expected 
stopping time for RS_OSR. 















A. Diversity Gain 

To compute the diversity gain of the relay selection strat¬ 
egy RS_OSR, we consider the error probability and show that 
RS_OSR achieves full diversity order. This result is stated in 
the theorem below. 

Theorem 2: The relay selection strategy RS_OSR defined 
by (j2^-(24i achieves full diversity order. That is, when N 
relays are available 

log(P.(7)) 


Diversity gain :=— lim 


log( 7 ) 


= (A^+1), 


(25) 


where P( 7 ) is the error probability under RS_OSR with 7 
being the SNR. 

Proof: In order to compute the error probability of our 
relay selection strategy, we first consider the error probability 
when the relay selection stops at stage n and k < n is 
selected. Let Pg „( 7 ,x) be the error probability conditional 
on = X and the relay selection stops at stage «. From 
Appendix [II| there exists constants Am,Bm such that 

1 — r 

Pc,«(7:'^) <^Mexp(-BM^;,- XCnY) 

2 qi 

+AMexp{~BM—xcnY):^QniYix). (26) 

2 q 2 

Now using ( p 6 | l, we try to get an upper bound for the error 
probability when the relay selection stop at stage n. Under 
RS_OSR, ( [ 2 ^ implies that the relay selection stop at stage 
n, n<N if > f„/c„ and R^., k <n is selected when 
(Ok = n„. Consequently, for n < A — 1 we have 


P(error, relay selection stops at time n) 

=P (error, >tnlcn) 

= [ Pe,n{'y^x)dFn^{x) 

< f Qn{Y,x)dFa,Xx) 

hil ^}i 

<Qn{YfnlCn)- (27) 


By (241, RS_OSR selects relay Rk if (Ok — TIn > Oi.rf and 
selects no relay if < ( 0^4 at stage N. From [11, chap. 8 ], 
we know that the error probability if the selection stops at 
stage N is upper bounded by 


where the last equality in (29 1 is true because QnijfnICn) 
is the sum of two exponential functions that are dominated 
by (CGcAr 7 )^(^+*^) when 7 is large. From the upper bound 
the diversity gain can be computed by 

log(Pe(7)) 


Diversity gain = — lim 


>- lim 

7-^00 


log( 7 ) 

log((l+o(7))(CGc;v7)^''^+'>) 


log( 7 ) 


=(A+1). 


(30) 


Theorem states that the relay selection strategy RS_OSR 
achieves the full diversity order of A + 1 with N relays. 


B. Bandwidth Ejficiency 

Let N* be the optimal stopping time in RS_OSR (defined 
by (20i). The expectation of N* can be computed as follows 

mn = 1 +E HK > n) = 1 +E < td) 

n=\ 71=1 

N N 

<l + EP(ii«<G) = l + E(P(®i (31) 

71=1 71=1 

Let tN^max = max„=i^2,...,7v(fn) and pN = P(a)i < tN^ax), then 
from ( [3 T] i we further obtain 

e[a;]<i + E(p(®i <G)r 

71=1 


N 

sEp» = t 

n=0 ^ 




-PN 


(32) 


If 1-Rv > e for some e > 0 , the expected stopping time of 
RS_OSR is bounded for any number of relays. We show in 
the theorem below that it is indeed the case. 

Theorem 3: There exists some positive constant £ > 0 
such that 1 — piv > £ for any N. Consequently, the expected 
number of probing for the relay selection strategy RS_OSR 
described by ([2^-([24| is bounded above by 

E[a;]<^'—(33) 

Proof: Since Viv(x) = c^yx, from we know that 1 
satisfies 


P (error, relay selection stops at time N) 
<(CGcA^ 7 )^(^^'(p(relay selection stops at time N) 
<(CGc^ 7 )-('^+'(, (28) 

where CG is a constant defined in [11, chap. 8 ]. 

The combination of (27 1 and ( [28] l gives the complete error 
probability as follows. 

N 

Pe( 7 ) = E ^(sn'or, relay selection stops at time n) 

77=1 

< E Qn{YfnlCn) + {CGcNY)-^^^'^ 


71=1 


=(1+o(7))(CGcw7)-('^+‘) 


(29) 


CN-Pn-X =E[Vv(£liv)|£lAr_i =fiV-l] 

=E[c 7 vmax(fAr_i,®A,)]. 

Let h{x) = E[max(x,Oti)], we get 

hfxi-i) cn-x l+Nx 


tN-x cn 1 + (A— 1 )t 

h(x\ 


> 1 . 


(34) 


(35) 


Note that the function is strictly decreasing as shown 
below. We first compute the derivative of h{x) as follows. 

h'{x) = (J xdF(aX03)+ J (OdFafo)) 


=xR^j (x) (x) -xF'^^ (x) 

=Efi),(jc), 


(36) 
















where Fa^{x) is the CDF of coi. Moreover, h{x) = 
E[max(;ic,a)i)] >x. Then 

(^) 

= \ (xFonix) - h{x)) 

< \ (x) - x) < 0. (37) 

Define g{x) as the inverse function of then g{x) is also 
strictly decreasing. 

Let t* be the solution to = 1, i.e. t* = g(l). Then, it 
follows form ( [35] l that 

<^(l)=f*- (38) 

Furthermore, we want to show that 


tn < for all « < — 1. (39) 


The proof is done by induction. 

( |39l l is true for — 1. Assume the ( |39| l holds for « + 1. 

For stage n, if f„ < f„+i, we get t„ < f„+i < fiv-i by the 
induction hypothesis. If t„ > f„+i, max{t„,co) > f„+i for any 
(0. From ( |22l l we obtain 

— tfi] —(max(tH, ))] 

=E[c„+imax(f„,®„+i)]. (40) 


Then, from (21 1 for f„ we get 


—L] 

=E[c„+imax(f„,®„+i)] 


Therefore, 



Note that 

Cn l + (n+l)T ^ 1 +At Ctv —1 

Cn +1 1+nT 1 + (A—1 )t Ci\[ 

Since g(x) is decreasing, we have 


tn=g 




<g 


CN -1 

Cn 


= tN-l- 


As a result of the above analysis, we have 


tn<tN-l <t* 


for any « = 1,2,...,A—1. Moreover, 

Pn — P(®i < In ,max^ ^ P(®i<C)<l. 

We can now define e = 1 — P(a)i < t*) > 0, then 
1-P(®i <t*) = £. From (|32| we obtain 


e[a;] < 


^-PN 


< 


1 -e 


(41) 

(42) 

(43) 

(44) 

(45) 

(46) 
PN > 

(47) 


for any total number N of relays. 


From Theorem]^ we know that the expected stopping time of 
the relay selection strategy RS_OSR is bounded. Therefore, 
the expected time for each transmission is bounded by 

E[7;„„(l +a;t)] < 7;„„(1 + T^). (48) 

V. Simulation results 

We simulate a relay networks and present simulation 
results. To implement RS_OSR, we solved the fixed point 
equation (0 in an offline fashion. In particular we use an it¬ 
erative heuristic to solve the equation with the help of Monte 
Carlo sampling (to remedy the computation complexity from 
calculating the conditional expectation in pT)). 

In the simulation we compare our proposed relay selection 
strategy RS_OSR with the optimal relay selection (w.r.t. 
indices = 1 , 2 ,...,A defined by Q) that probes all 

relays, which is denoted by RS_ALL. Note that the optimality 
(can achieve full diversity order) of RS_ALL is proved in 
[ 11 ]. 

We start with comparing the error probability Pg. Figj^ 
and Fig |2 show the comparison for error probability with 
T = 0.05 and T = 0.1 respectively. From Figj^and Figj^we 
observe that RS_OSR achieves full diversity order as com¬ 
parable with RS_ALL. Moreover our algorithm outperforms 
RS_ALL at finer degree consistently. This dues to the gain 
of efficient bandwidth and power saving. We also observe 
that the advantages of RS_OSR is more obvious when T is 
higher. This is intuitively true ; our optimal stopping selection 
strategy helps save sensing time and power and therefore the 
advantage becomes more and more clear when the sensing 
complexity becomes higher. 


SNRn^rm ^e’ 



Fig. 3. BER performance with T = 0.05 


SNR^orm P^, T = 0.1 



Fig. 4. BER performance with T = 0.1 




























Bandwidth efficiency results (rate is measured by the 
average c„ over all sample path) are shown in Figl^ and 
Figl^ with T = 0.05 and T = 0.1 respectively. From Figj^and 
Figjm we conclude that under both cases, the probing time 
of RS_OSR is bounded, while the probing time of RS_ALL 
increases (thus the bandwidth efficiency decreases) as the 
number of relays increases. 



Fig. 5. Bandwidth efficiency with t = 0.05 


rate vs relay number, z = 0.1 



Fig. 6. Bandwidth efficiency with T = 0.1 


VI. Conclusion 

In the presence of non-negligible probing time for getting 
relay channels’ instantaneous quality, obtaining full informa¬ 
tion for the purpose of relay selection leads to an inefficient 
use of bandwidth in cooperative communications. We design 
and implement a stopping rule based relay selection strategy 
RS_OSR and proved its optimality regarding achieving full 
diversity order. Moreover, the probing time for sensing 
relay channels under RS_OSR is shown to remain bounded 
regardless of the number of relay candidates. We establish 
and demonstrate the above two properties by both analytical 
and simulation results. 
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Appendix I 
Proof of Theorem[T] 

Proof: First, we want to prove that the value function 
V„{x) and =x] are convex and increasing. 

The proof is done by induction. 

At the final stage N, the value function Vn{x) = c^x 
is obviously convex (linear) and increasing. Assume that 
Fn+i {x) is convex and increasing. Then, at stage n we have 

E[y„+i(£l„+i)|£l„ =x] =E[y„+i(max(.x,®„+i))|fl„ =x] 

=E[y„+i(max(.x,®„+i))], (49) 

where the last equality holds because (0„+i is independent 
of fi„ (due to the assumption of independence among 
channels). Since Vn+i{x) is convex and increasing and the 
function max(;ic, a)„+i) is convex and increasing in x, by 
the property of convex functions (see [24]) we know that 
their composition y„+i(max(x, a)„+i)) is convex and increas¬ 
ing in X for every a)„+i. Therefore, E[y„+i (n„+i)|n„ = 
x] is also convex and increasing, as it is the the expec¬ 
tation of y„+i(max(x, ®„+i)) over ( 0 „+i. Since y„(x) = 
max{c„x,E[y„+i(f2„+i)|f2„ =x]} by ([^, y„(x) is also con¬ 
vex and increasing. 

With the convexity established above, we know that for 
any n = 1,2,.. .N — 1, E[y„+i(f2„+i)|f2„ = x] and c„x can 
have at most two intersections because £’[y„+i(f2„+i)|fi„ = 
x] is convex and c„x is linear. We show below that they have 
exactly one intersection. For that matter, we want to prove 
by induction 

Vn{x) < C„E[nAr|f2„ =x]. (50) 

Inequality ( |50l l is easily true at stage N. Suppose ( |50| l is true 
for n -f 1. At stage n, since c„ > c„+i we obtain 

y„(x) =max{c„x,E[y„+i(£l„+i)|£l„ =x]} 

< max { c„x, E [c„+ 1 E [Hat I n 1 ) ] I = x]} 

= max{c„x,c„+iE[f2iv|f2„ =x]} 
<max{c„x,c„E[nAr|n„ =x]} 

=C„E[nAr|n„ =x]. (51) 

Then inequality ( |50| l is true at any stage. Consequently, we 
get, as X —>■ oo, 

Vnjx) ^ CnE[nAf|nn = x] 

CnX CfiX 

=E[max(l,"'^''^^)] 

+ (52) 

where the convergence in ( [52| is true because of the fol¬ 
lowing. Note that converges point-wise to 0, and 

^ max,t>nfor X > 1. Since maxi>„a)i; is in Li, 
by dominated convergence theorem converges to 0 

in expectation. 

When X = 0 we have 


From ( [ 52 I 1 and (lU, the linear function c„x is above 
E[y„+i (n„+i)|n„ = x] for large x, and c„x is below 
E[y„+i (n„+i)|n„ = x] when X = 0. Therefore, the two 
functions c„x and E[y„+i(fi„+i)|n„ = x] have exactly one 
intersection. 

Let f„ denote the intersection of c„x and 
E[y„+i(n„+i)|n„ =x], we get 


/ CnX if .r > , 

\ E[y;i-i-i= x] ifx<tn. 


(54) 


As a result of (54i, the optimal stopping time is described 
by ■ 


Appendix II 

Proof of (|^ in Theorem[^ 

Let <t>( 7 ) be the symbol error rate function with SNR 
equals 7 for each node. Since we apply perfect DF protocol, 
there are two cases to consider with. In the first case the relay 
perfectly reconstructs the signal and the destination receives 
both signals from the source and the relay. In the second 
case the relay fails to reconstruct the signal, so only the 
source transmitted signal is received at the destination. The 
probability for the first case is 1 and the one for the 

second case is 0(7^;) where jk = = rcOs^kCnY is the SNR 

at Rj.. The error rate at the destination D is given by ‘i>{Yd)^ 
where Yd is the SNR at D. Yd is can be computed as 

f {rcOs^d + (1 - r)( 0 k,d)cnY in the first case, 

^ \ fCOsdCnY in the second case. 

(55) 

Let Pe,n{Yi^s,kj(>^s,d,(’}kj) be the error probability condi¬ 
tional on the channel parameters 0 )^ jtj ( 0 s,dt( 0 k.d and the relay 
selection stops at stage n and is selected. The error 
probability can be calculated as follows. 


E^,«(7; ^s,kj ^s,dj (^k.d) 

= {l-<P{rCOs^kCnY))^{{rO}s,d + {i-r)(Ok^d)CnY) 

+ ^{r(Os^kCnY)^i>'(Os,dCnY) ■ ( 56 ) 


The error probability calculated in ( [56| is complex. However, 
a proper upper bound can allow us to analyze the diversity 
gain of our relay selection strategy. We proceed to obtain an 
upper bound on the error probability. 

Note that from the definition of ( 0 ^, we have 


(Os,d > 


(Ok 

2 q 2 ’ 


(Ok,d > 


(Ok 

2 qi 


(57) 


c„x = 0 < E[y„+1 ((Oy,+i )] = E[y„+1 (Li„+1 ) \ a „ = 0]. (53) 


Following which we strike an upper bound for (56 1 as 












follows. 


^e,n{Y-) ^s.k-i ^s,d-! ^k.d) 

=(1 - <P{rcOs,kCn7))’i>{{r(0>;,d + (1 - r)cOk^d)cnY) 

+ <P{rCO,,kCnr)'^{rCOs,dCn7) 

<<I>((1 - r)C0k,dCn7) +^{rO)s^kCn7) 

<’i>{\^(OkC„7)+<P{^(OkC„7) 

Iqi lq2 

1 — r r 

- (OkCn7) +AM&^v{-BM2^0)kC„7) . 

2qi Iqi 

(58) 


where Am,Bm are constants depending on the modulation 
scheme as commonly adopted (e.g., [25]) and the last in¬ 
equality in ( [58] l follows from the properties of the error 
probability function ^(y). 

From ( |58| l we have 

1 — r 

<Am^^V(.-Bm^ - XCn7) 

2qi 

+AMexp{-BM—xc„7 ):^Qn{7,x) (59) 

2 q 2 

and (|2^ is established. 





